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Abstract 

In this paper, sensor selection problems for target tracking in large sensor networks with linear equality 
or inequality constraints are considered. First, we derive an equivalent Kabnan filter for sensor selection, 
i.e., generalized information filter Then, under a regularity condition, we prove that the multistage look- 
ahead policy that minimizes either the final or the average estimation error covariances of next multiple 
time steps is equivalent to a myopic sensor selection policy that maximizes the trace of the generalized 
information gain at each time step. Moreover, when the measurement noises are uncorrected between 
sensors, the optimal solution can be obtained analytically for sensor selection when constraints are tem- 
porally separable. When constraints are temporally inseparable, sensor selections can be obtained by 
approximately solving a linear programming problem so that the sensor selection problem for a large 
sensor network can be dealt with quickly. Although there is no guarantee that the gap between the per- 
formance of the chosen subset and the performance bound is always small, numerical examples suggest 
that the algorithm is near-optimal in many cases. Finally, when the measurement noises are correlated 
between sensors, the sensor selection problem with temporally inseparable constraints can be relaxed to 
a Boolean quadratic programming problem which can be efficiently solved by a Gaussian randomization 
procedure along with solving a semi-definite programming problem. Numerical examples show that the 
proposed method is much better than the method that ignores dependence of noises. 
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1 Introduction 



Over the past twenty years, advances in sensor technologies have led to the emergence of large numbers of 
low-cost sensing devices with a fair amount of computing and communication capabilities. Large sensor 
networks have attracted much attention both from theoretical and practical standpoints and have become a 
fast-growing research area. To efficiently manage large sensor networks, one typically designs a policy for 
determining the optimal sensor network performance and resource utilization at each time, within logical or 
budget constraints. The most comprehensive recent survey on sensor management is provided in the book 
Ifn . Discussion on more advances in this area is available in the recent survey paper ||2] and references 
therein. In this paper, we concentrate on sensor selection problems in which a subset of sensors are selected 
at each time instant while tracking a target that provides optimal performance-resource usage tradeoffs. 

The sensor selection problem arises in various applications, including target tracking, e.g., ||3]lll^ robotics 
||5l, and wireless networks ||6]. Sensor selection for the target tracking problem will be considered here. In 
the literature, the sensor selection problem has been formulated for different dynamic systems. In [3|, the 
state model was assumed to be deterministic without noise. A convex optimization procedure was developed 
based on a heuristic to solve the problem of selecting k sensors from a set of m sensors. Although no 
optimality guarantees could be provided for the solution, numerical experiments showed that it performed 
well. Another important contribution comes from the work reported in [4| where the state model was assumed 
random with noise and a general objective function of the sensor selection problem was transformed to a 
quadratic form by introducing the gain matrix as an additional decision variable. However, the resulting 
optimization problem cannot efficiently take advantage of the structure of the covariance of measurement 
noise such as it being a diagonal matrix in the uncorrelated case. In this paper, the sensor selection problem 
formulated by the use of the Moore-Penrose generalized inverse only relies on Boolean decision variables 
without introducing additional decision variables. The resulting optimization problem can efficiently take 
advantage of the structure of the measurement noise and obtain the optimal solution analytically. Many 
other excellent results on sensor selection for state estimation in different situations can be found in, e.g., 
||7l[8l|3[ia[IIl[l2l[l3l[Il[l5l[l6l[l7l[Iland references therein. 

Sensor management problems are often considered with different criteria and objectives. Representa- 
tive approaches for sensor management include optimization of estimation error covariance [H ^J, Fisher 
information ||9][I3, and entropy or mutual information ||5]|ni[l2l[l3l[l4l[l5l[l8l[l9l. Various functions of 
the estimation error covariance and Fisher information matrix, including their determinant and trace, have 
been used as reward functions for optimal sensor management. Several popular measures, including Renyi 
entropy, KuUback-Leibler (KL) divergence, and Hellinger-Battacharya distance, have been used for the cal- 
culation of information gain between two densities. In this paper, based on the Moore-Penrose generalized 
inverse, we will derive a closed-form expression of information gain for sensor selection called generalized 
information gain whose trace function is taken as the reward function for optimal sensor selection. When the 
measurement noises are assumed independent, the notion of information measure based on the information 
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gain has been discussed in the Uterature, see |[T9l . 

In this paper, we consider the problem of state estimation for a linear dynamic system being monitored 
by multiple sensors. For sensor selection, we first derive an equivalent Kalman filter for sensor selection, 
i.e., generalized information filter. Then, under a regularity condition, we prove that the multistage look- 
ahead policy that minimizes either the final or the average estimation error covariance of next N time steps is 
equivalent to a myopic sensor selection policy that maximizes the trace of the generalized information gain at 
each time step. Thus, trace of the generalized information gain is defined as a measure of information that the 
selected sensors provide at each time step. Moreover, when the measurement noises are uncorrelated between 
sensors, the optimal solution can be obtained analytically when the constraints are temporally separable. 
When the constraints are temporally inseparable, the solution of the sensor selection problem can be obtained 
by approximately solving a linear program (LP) so that sensor selections for a large sensor network can be 
performed quickly. Although there is no guarantee that the gap between the performance of the chosen 
subset and the performance bound is always small, numerical examples suggest that the algorithm is near- 
optimal in many cases. Finally, when the measurement noises are correlated between sensors, the sensor 
selection problem when the constraints are temporally inseparable can be relaxed to a Boolean quadratic 
programming (BQP) which can be efficiently solved by a Gaussian randomization procedure along with 
solving a semi-definite programming (SDP) problem which can be solved by interior-point methods [20]. 
Numerical examples show that the proposed method yields solutions that are much better than the method 
that ignores dependence. 

The rest of the paper is organized as follows. Preliminaries are given in Section [2l where the generalized 
information filter for sensor selection and multistage sensor selection problems that minimize either the final 
or the average estimation error co variances over the next N time steps are formulated. In Section |3l under 
a regularity condition, we prove that multistage look-ahead policies are equivalent to the myopic sensor 
selection policy that maximizes the trace of the generalized information gain at each time step. In Section 
m the case of uncorrelated measurement noises is considered. The optimal solution is derived analytically 
for sensor selection when the constraints are temporally separable. When the constraints are temporally 
inseparable, the sensor selection scheme is obtained by approximately solving an LP. In Section |4l the case 
of correlated measurement noises is considered. The sensor selection problem is relaxed to a BQP which can 
be efficiently solved by a Gaussian randomization procedure along with solving an SDP problem. In Section 
m numerical examples are given and discussed. In Section |6l concluding remarks are provided. 

2 Preliminaries 

2.1 Problem formulation 

We consider a surveillance region of interest (ROI) that is being monitored by a sensor field for potential 
targets crossing the ROI. The fusion center tracks the target by optimally selecting a fixed number of sensors 
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from a large sensor network under some logical or budget constraints. Specifically, we consider a L-sensor 
linear dynamic system 



Xfc+i = FfcXfe+Wfc, (1) 
yi = Hi,Xfc + v^, i = l,2,...,L, (2) 
4 = 7iH*fcXfe + 7^v|, i = l,2,...,L, (3) 



where x/, G W,Fk G M'''''' is an invertible matrix^J; y*^ G M*"', H*^ G {w^} and {v*^} are both 

temporally uncorrelated with zero means and invertible covariances Q^, and respectively. The covariance 
of the noise = ((v^)', . . . , (v^)')' is denoted by = Cov(vfc) which is assumed invertible, R^-' = 
Cov(v^, v;^,) so that R^* = R^. If the i-th sensor is selected, we let 7^ = 1, otherwise 7^ = (see, e.g., f4l); 
Ik — ill, ■ ■ ■ , Ik)'- ^hall focus on Equations © and ^ for sensor selection. The stacked measurement 
equation is written as 



Zfc = HfcXfc + Vfe, (4) 



where 



z, ^ iiziy,...,{zj^)y, (5) 

vfc ^ ((7M)',---,(7M)')', (6) 

H, ^ {Uuiy,...,{^i;uj^)y. (7) 

The covariance of the noise is denoted by 

Rfc ^ Cov(v,), fC^ 4 Cov(7i.vL 7^i) = T^^R^ (8) 

Moreover, we denote by zi-k = {z[,.. • ,z'^)', x^i^ = E[xfe|zi:fc], P^ifc = - ^k){^k\k - Xfc)']- 

At time tk, the fusion center has 7^, i = 1, . . . , L, x^i^ and P^i^ (or measurements zi-^). The fusion 
center is to design the sensor selection scheme for the next time steps. At time tk+n, fnk+n sensors will be 
selected from L sensors, for n = 1, . . . , A. They will send their measurements, compressed measurements 
or local estimates to the fusion center. The fusion center makes the final estimates for the state at times tfc+n> 
n = 1, . . . , A. The problem is how to select mk+n sensors from L sensors (i.e. determine the Boolean 
decision variables 7^^„, « = 1, . . . , L), n = 1, . . . , A that minimize the objective function which is 

• either the final estimation error covariance 

/l — ^k+N\k+Nj (9) 



'The invertibility of the transition matrix can be guaranteed in tracking problems, see II21I . 
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• or the average estimation error covariance 

1 ^ 

f2 — J^ ^Pfc+n|fc+n- (10) 

n=l 

The constraint that nif^^n sensors are selected from L sensors, n = I,... ,N induces a constraint that 
is temporally separable. Moreover, we shall also consider constraints that are temporally inseparable, for 
example, energy constraints. 

Note that the objective functions (l9l)-(fT0l) are matrices. Matrix optimization considered here is in the 
sense that if x* is an optimal solution, then for an arbitrary feasible solution x, P{x) ^ P{x*), i.e., P{x) — 
P{x*) is a positive semi-definite matrix (see, e.g., li20J ). 



2.2 Equivalent Kalman filter for sensor selection 

It is well known that the Kahnan filter provides the globally optimal solution if the noises are assumed 
Gaussian, otherwise it provides the best linear unbiased estimate. It is recursive no matter whether the 
covariances of noises are invertible or not and is given as follows (see, e.g., HH), 

Xfe+i|fc+i = + Kfc+i(zfc+i - Hfc+iXfc+i|fc), (11) 

Pfe+1|A;+1 = (I — Kfc+iHfc+i)Pfc+i|fc, (12) 

where I is an identity matrix with compatible dimensions, 

Xfe+i|fc = F/jXfc|fc, (13) 

Kfe+i = Pkj^i\kil'k+i(H.k+i'Pk+i\k^'k+i +'^k+i)~^ , (14) 

Pfc+l|fc = '^k^klk^'k + Qfc- (15) 

The superscript means Moore-Penrose generalized inverse (see, e.g., ||23l )FI. If P f,_^_i^i^, P |^_^_^f,_^_l and 
Rfc+i are invertible (for example, the case that all L sensors are selected), then we have the following 
equivalent Kalman filter 

Xfc+l|fc+l = ^ k+l\k+l(P kll\k^k+l\k + 'H.'k+iP-k+l^k+l) , (16) 
Pfc+llfc+l = (Pfe+l|fc + ^fc+lR-fc+lHfc+l) \ (IV) 

which is usually called the information filter and H'^^^R^^^Hfc_|_i is called the information gain (see e.g., 
ll24l ). Once the sensors are selected, the covariance of the noise vector of the selected sensors is invertible. 

^Here, the Moore-Penrose generalized inverse is used since tlie (Hk+iP k+i\k^'k+i + R-fc+i) rnay not be invertible. The reason 
is that Hfe+i and R^+i defined by Q and {HJ for the sensor selection problem include the decision variables {-/l^i, . . . , 7^+1} 
which have L — nik+i number of zeros. 
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Note that we assumed that Q^, A; = 1, 2, . . . , are invertible, and it is easy to check that Pfc+i|A,. and Pfc+i|fc+i 
which are updated by the Kalman Filter based on the selected sensors are also invertible. Here, however, 
Rfc+i are not invertible, since there are L—irik+i number of zeros in the decision variables {'J^.^^i , • • • , 7^+1 }■ 
Thus, we first prove that, for the dynamic system ((T) and (H) defined under sensor selection where R^+i are 
not invertible, there still exists an equivalent Kalman filter similar to ([T6]l-([r7Ti. 

Theorem 2.1. For the dynamic system defined by ([7]) and (I?]) under sensor selection, we have the following 
equivalent Kalman filter (generalized information filter) 

= Pfc+l|A:+l(Pfe+i|fcXfc+l|fc + H'^.+lR-fc+l^fc+l)' (1^) 

Pfc+i|fc+i = (Pfc+i|fe + Hfc+iR^_^iHfc+i)~-^. (19) 



Proof. See appendix. 

The key difference in Theorem 12.1 l is that R-^^^ where Rfc+i is invertible in (fT6l)-(fr7]) has been replaced 
by R^_^]^ in (fT8l)-([T9l). Due to this difference, H^^^R^^^H^+i will be called generalized information gain. 
Notice that Pfc+i|fc_(_i is a function of ■ ■ ■ , T^+i, since Hfe+i and R^+i are functions of • • • , T^+i- 
Thus, it is denoted by Pk+i\k+i{ll+i, 7fc+i)- Similarly, Pk+n\k+n{ll+i, • • • , Ik+v ■ ■ •> 7^+^, • • • , 7fc+n) 
is a function of -fl^^, . . . , j^^^, . . ., -fl^^, . . ., 7f'_^„, forn = 1, . . . , iV. 

It is the generalized information filter based on Moore-Penrose generalized inverse that helps us decou- 
ple the multistage look-ahead policies to an equivalent myopic sensor selection policy that maximizes the 
generalized information gain with a lower computational complexity in Section |3l Another advantage is 
that the sensor selection problem formulated by the use of the Moore-Penrose generalized inverse only re- 
lies on Boolean decision variables without introducing additional decision variables and can efficiently take 
advantage of the structure of the measurement noise to obtain the optimal solution and efficient algorithms. 

2.3 Optimization problems for sensor selection 

Thus, by using Theorem 12. 1[ the two sensor selection problems can be stated as 

min ^k+N\k+N{lk+li ■ ■ ■ ilk+li ■ ■ ■ >7fc+Ar) • • • Hk+N) 

= {^k+N\k+N-l + Hfc+AfRfc+^Hfc+Af) ^ (20) 
L 

subject to ^ 7fc+„ = nik+n, n = l,...,N, (21) 

i=l 

7i+ne{0,l},i = l,2,...,L, n = l,...,N, (22) 
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and 

min 
subject to 

3 Generalized Information Measure for Sensor Selection 

In this section, we consider some properties of the optimization problems presented in Section [23] that will 
simplify their solution. We will show that if the primal sensor selection problem (|20l ) has an optimal solution, 
then both the problem (l20l ) and the problem (1231 ) can be transformed to equivalent optimization problems that 
maximize an information measure at each time step. 

Lemma 3.1. Consider two optimization problems: 

(Ai) max M(x), (24) 
(A2) max tr(M(x)), (25) 

where M(x) is a matrix for an arbitrary x G S; S specifies the constraint on the decision variable x. If the 
problem (Ai) has an optimal solution, then the problem (Ai) is equivalent to (A2). 



N 



n=l 



Pfc+n|A:+n(7fe+l> • • • ) 7fc+li • • • ) Tfc+AT) • • • ) Iu+n) 

1 

TV 



(23) 



n=l 



^ Ik+n = mk+n, n = l,...,N, 



i=l 



7L„e{0,l},i = l,2,...,L, n = l,...,iV. 



Proof. See appendix. 

Lemma 3.2. Consider two optimization problems: 

(Bi) min M„(xi, x„), /or n = 1, A^, (26) 

Xi£Si,i=l,.--," 

N 

{B2) min Vm„(xi,...,x„) (27) 

x„e<S„,n=l,...,Af, ^ 
n=l 

where M„(xi, . . . , x„) is a function of decision variables xi, . . . , x„,/or n = 1, . . . ,N. If the optimal solu- 
tion that minimizes M„(xi, . . . , x„), (xj, . . . , x* ), is the same as the one that minimizes M„_|_i(xi, . . . , x„_|_i), 
for n = 1, . . . ,N — 1, then the optimal solution that minimizes M7v(xi, . . . , xat) ((Bi) with n = N) is the 
same as that for (B2). 
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Proof. See appendix. 

Based on Lemma [l!2l the solution to both the problem (l20l ) and the problem (1231 ) can be simplified and 
obtained by solving N optimization problems separately. 

Lemma 3.3. If the primal sensor selection problem f |20D has an optimal solution, then both the problem d2QP 
and the problem d2jD can be transformed to the equivalent problem that solves N optimization problems that 
maximize H^_|_^R^^_^Hfc+„, n = 1, . . . ,N respectively, i.e., 

max H'fc+„R+^^Hfc+„ for n = 1, . . . , N, (28) 

L 

subject to ^ 7^,+„ = nik+n, 
1=1 

Ik+n € {0,1}, i = 1,2,. ..,L, 

where 'H.k+n <^f^d Rfc+n are defined in Equations (fT]) and (jS]) respectively; the superscript indicates 
Moore-Penrose generalized inverse A23I/ . That is, the problems f |2QD . f |2jD f |28D /tave rame optimal 
solution. 

Proof. See appendix. 

Remark 3.4. Lemma [33] show s that multistage look-ahead policies, i.e., the problem (|20l ) and the problem 
( [23] ). are equivalent to a myopic sensor selection policy that maximizes the generalized information gain with 
a lower computational complexity. Why do the problem (|20] | and the problem (l23T i with different objectives 
have the same optimal solution? The main reason is that the objectives and constraints are temporally sepa- 
rable. For example, consider mfc+„ = 1, i.e., select one sensor at each time step, if there is a sensor which 
has the smallest noise and provides the most information at each time step, then the selection of the sensor at 
each time step is the optimal sensor selection scheme no matter whether the objective is the final estimation 
error covariance or the average estimation error covarianc e. 

Moreover, based on Lemmas |3 . 1 1 and 1331 we have the following theorem. 

Theorem 3.5. If the primal sensor selection problem f |20D has an optimal solution, both the problem d2QD 
and the problem ( 1251 ) can be transformed to the equivalent problem requiring the solution of N optimization 
problems that maximize tr(H^^^R^^^Hfc+„), n = 1, . . . ,N respectively, i.e., 

max tr(H'fc+„R+^„Hfc+„) for n = 1, . . . ,N, (29) 

L 

subject to ^ 7|._j.„ = ruk+n, 
1=1 

Ik+n e {0,1}, i = 1,2,...,L, 
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where ^k+n cmd Rfc+n cif^ defined in Equations ^ and (H)) respectively. That is, the problems d20D . ([23\l 
and d29P have the same optimal solution. 

Remark 3.6. Theorem 13.51 shows that both the minimization of the final estimation error covariance and 
minimization of the average estimation error covariance are equivalent to maximization of the trace of the 
generalized information gain of each time step. Thus, the objective function 

tr(H;.+„R^+„Hfc+n) (30) 

of the problem (1281 ). i.e., trace of the generalized information gain, is defined as a measure of information 
that the selected sensors provide at (A; + ?i)-th time step. Determinant of the generalized information gain can 
be similarly defined as a measure of information if H'^^^R^^^H,fc+„ is a positive definite matrix. When the 
measurement noises are assumed independent, more infonnation measures based on information gain can be 
formulated, see e.g., |[T9l . 

Furthermore, the information measure (|30) has an advantage that it does not depend on pdfs of the states 
and measurements, but only relies on covariances of noises and the measurement matrices. Maximizing 
this measure can be employed as an alternative criterion for sensor selection, which will be used in sensor 
selection problems when the constraints are temporally inseparable in the following sections. When pdfs 
are known, it is better to try to use the information criteria based on pdfs such as optimization of Fisher 
information, entropy or mutual information for sensor selection (see, e.g., [2]). 



4 Sensor Selection Schemes for Uncorrelated Sensor Measurement Noises 

4.1 Optimal Sensor Selection Scheme for Temporally Separable Constraints 

When sensor measurement noises are uncorrelated and the constraints are temporally separable, we have the 
following result that defines the optimal sensor selection scheme. 

Theorem 4.1. Let the information measure corresponding to the i-th sensor at (k + n)-th time be de- 
noted as = tr((H^^„)' (R*^_^„)-iH^^„), i = 1, . . . , L. Let {a'^^^, . . ., a^^^} denote {a^^^, . . 
Ofc+nl rearranged in descending order. If the problem d20D has an optimal solution, then the optimal 
sensor selection scheme for both the problem d20D and the problem d2jD is 7^1,_„ = 1, . . . ,7jr.™'^" = 1, 
Ik+n^^^ = 0, . . . , 7^^„ = 0, for n = 1, . . . ,N. The optimality of sensor selection scheme is in the sense 
of either the minimization of the covariance of the final estimation error dP]) or the average estimation error 
diOl ) or maximization of the information measure dJOD . If the problem d2QD does not have an optimal solution, 
the optimality of the sensor selection scheme is only in the sense of maximization of the information measure 

m. 



Proof. If the measurement noises are uncorrelated between sensors, then R^+n is a block diagonal matrix 
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with fC^_^_n = 0,iy^j. Thus, 

^t+n = (diag(7fc+„i?fe+„, • • = diag(7fc+„(^i+„)"^ • • • , 7fc+n(^fc+n,)"^) (31) 

which follows from the definition of Moore-Penrose generalized inverse. Moreover, by Theorem 13.51 the 
problem (l20l ) is equivalent to 

rnax tr(H'fc+„R+_^Hfc+„) forn = 1, . . . , A^, 

Tfc + n 

L 

i=l 

L 

subject to ^ 7^+„ = nik+n, 
1=1 

lUn^{0,l},i = l,2,...,L. 

If we define a*,^„ ^ tr((ff,^j'(R^_, J-1h1_, J, z = 1, . . . , L and {4V„, • • J denotes {ai+„, . . ., 
rearranged in descending order, then the optimal solution is 7^Yn = 1) • • • i Ik+n" ~ 1' Tfc+n^"^^ ~ 



4.2 Extension to Temporally Inseparable Constraints 

Many constraints on sensor selection can be represented as linear equalities or inequalities such as logical 
constraints and budget constraints (see, e.g., EH]). Let us denote linear equalities or inequalities as follows 

>bp,p = l,...,P, (32) 

where 

7 = (7fc+n • • • > ik+N)\ Ik+n = (7fc+n> • • • > 7fc+n)'> forn = 1, . . . , iV; (33) 

ap is a vector with a compatible dimension and bp is a scalar; can represent either ">" "<" or "=" 
for each n. In general, these constraints are temporally inseparable, which makes the optimization problems 
with objectives (l20l) and (1231 ) not separable and highly nonlinear in variables 7^^„. The corresponding 
optimization problems are very hard to solve. 

However, from Remark 1331 the trace of generalized information , tr(H'^^,^R^^^HA,.+„), can be defined 
as the measure of information that selected sensors provide. Thus, we can try to maximize the available 
information gain from time A; + 1 to + by optimizing the selection of sensors so that better estimation 
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performance can be expected. We shall consider the following objective (i.e, sum of the weighted information 
measure) 



TV 



/3 ^ ^a;„tr(H',+„R+^„H 



k+n) 



(34) 



n=l 



where Wn, n = 1, . . . , are weights which place different importance on different time steps. For example, 
if the state estimation at the final time is more important, a larger weight uj^f can be used. If the state 
estimation of each time step is equally important, an equal weight structure wi = • • • = ujn can be used. 
Therefore, we consider the following optimization problem for sensor selection: 



N 



max ^w„tr(H;+„R+^„Hfc+„ 



(35) 



subject to 



n=l 

ap7 > bp, p= 1, 
7fc+n e {0,l},i = l,2,...,L,n = l,...,N. 



L, . . . , i , 



Since sensor measurement noises are assumed uncorrected in this section, by Equation (BTI l. the problem 
(|35] ) is equivalent to 

N L 

inax E lUn tr((Hl+ j'(RUn)"'Hl^+n) (36) 

Ik+n n=l i=l 

subject to a'p-y > bp, p = 1, . . . , P, 

7i.+„ G {0,l},i = 1,2,..., L,n = l,...,iV, 

which is a Boolean linear programming (BLP) problem and the optimal objective function value is denoted 
by f%i^p- It can be relaxed by replacing the nonconvex constraints 7^_|_„ G {0, 1} with the convex constraints 
< 7|_,_„ < 1, i = 1,2, ... ,L,n = 1, ... ,N . Thus, we have the following LP problem: 

tr(rD') 



max 

Tfc + n 

subject to 



(37) 



ap7 > bp, p=l,...,P, 
0<7fc+n<l,i = l,2,...,L,n = l,...,iV, 

where 7 is defined by ( [33] ): F is a L x matrix with i-th row and n-th column element being 7fc_|_„, i.e, 

/ 11+2 ' ' ' ll+N \ 



\ 7fe+l lk+2 



11+N 



Ik+N I 



(38) 



and D is a L X matrix 



D 



Wntr((H^_,_„) (R^_,_„) ^^k+n 



(39) 
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It is well known that LP problems can be solved efficiently. The solution of the problem (|37] ) is denoted 
by {lk+n)LP^ i = 1, . . . , L,n = 1, . . . , N. The corresponding objective function is denoted by f^p. Note 
that the feasible solution set of the problem (|37] ) contains that of the problem (l36l ) so that fp^p < flp. 
Based on (7^_,_„)2p> we can generate a suboptimal feasible solution of the problem (l36l ) denoted by 7^^„, 
i = 1, . . . , L,n = 1, . . . , N. The corresponding objective function is denoted by /blp and Jblp < Iblp- 
The difference g = f^p — Jblp i& called the gap in fSl. The gap is useful in evaluating the performance of 
the suboptimal solution 7^_|.„. We can say 7^_,_„ is no more than (7-suboptimal. 

Note that the procedure of generating a feasible solution of the problem (|36l ) from (7|+„)2p is problem 
dependent, i.e., relying on the equalities or inequalities (|32l ) and the Boolean constraint. As an illustration, 
let us consider a representative example. Besides the temporally separable constraints (1211 and (|22] |. we 
consider an energy constraint which is temporally inseparable as follows 



N 



^jU^<mli = l,...,L, (40) 



n=l 



which means that the i-th sensor can only be selected m\ times from time /c + 1 to time k + N (ml, < N), 
for i = 1 , . . . , L. Thus, the specific form of the optimization problem (l37l i with the constraints (|2TI ) and (l40l ) 
can be represented to 

max tr(rD') (41) 

Tfc + n 

subject to ap7 > bp, p = 1, . . . , P, 

0<7i+„<l,^ = l,2,...,L,n = l,...,iV, 



where F and 7 are defined by (1381 ) and (1331 ) respectively; P = N + L, 



l^i^p, • • • , c^Vpj , c„^p — < n — i...,iv, 



^0, p / n, 

hp = ruk+p, "> "means" = ", (42) 
for p = 1, . . . , A^, (corresponds to the constraints (1211): 

6p 4 mj,, "> "means" <", (43) 
forp = A^ + i, i = 1, . . . , L, (corresponds to the constraints (|40]i). 

where 1 and denote L-dimensional vectors with 1 entries and entries respectively and Ij means an L- 
dimensional vector whose i-th entry is 1 others are Os. 

The sensor selection scheme with the energy constraint for uncorrelated sensors is described by the 
following algorithm. 
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Algorithm 4.2 (Sensor selection scheme with the energy constraint for uncorrelated sensors). 



Step 1: Given an optimal solution of (i41 D (7fc_j_„)2p> obtain the optimal objective function /£p. 



• Step 2: Generate a feasible solution of the problem M6\ with the constraints A21 1) . d22l) and fl40D from 
i^k+Jlp as follows. 

We generate the feasible solution based on the importance (weight) of the information of each time step. 
Without loss of generality, assume that oJi < UJ2 < ■ ■ ■ < ojn- Thus, we generate the selection scheme 
from the N-th time step to the first time step. Set the index set of candidate sensors i = {1, . . . , L}. 

— Iteratively generate ^\j^]\^_^for the {N — n)-th time step, n = 0, . . . , [N — 1) as follows: 
for n = : (iV - 1) 

A I 1, if i £ ii, 
iUn-u = ' ' • Jori = l,...,L, (44) 

[0, if I e 12, 

where ii is the index set of the first nik+N-n maximum entries of( {lk+N-n)*LP^ ■ ■ ■' i'^k+N-ri)*LP^ 
in the index set of candidate sensors i and denote i2 = i — ii- Set := — 1, for i G ii. 
Update the index set of candidate sensors i = {i : > 0, i = 1, . . . , L}. 
end 

• Step 3: Output g-suboptimal solution 7|,_|_„, i = l,...,L,?7- = l,...,A^ and the corresponding objec- 
tive Jblp, where g = flp - f^ip is the gap. 

Here, to construct the feasible solution satisfying the constraints dJT), (l22l ) and (|40] |. we employ the 
equation (l44l ). The main computation complexity is in Step 1 where an LP problem needs to be solved. 
Illustrative examples will be presented in Section |6l 



5 Sensor Selection Schemes for Correlated Sensor Measurement Noises 

In this section, for correlated sensor measurement noises, we again determine the sensor selection scheme by 
maximizing the weighted information measure: 



max 



TV 

j;w,tr(H'fc+„R+_^„Hfc+,) (45) 

subject to \> bp, p = I, . . . , P, 

7i+„G{0,l},i = l,2,...,L. 



where the linear constraints are defined in (132) that may include both the temporally separable and inseparable 
constraints. Since sensor measurement noises are correlated, to obtain the optimal solution, an exhaustive 
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search is necessary since Rfc+n has no special structure such as it being a diagonal matrix. For the simplest 

^ f L \ 

case of the temporally separable constraint (|21) and = 1, there are a total of feasible solutions. 

For large L and rrik^n^ such an exhaustive search may not be feasible in real time. Thus, to make the solution 
computationally more efficient, the problem (1451 ) is approximately solved by replacing by R^^„- This 

approximation is lossless for the case of uncorrelated sensor noises and temporally separable constraint (i.e., 
does not change the optimal solution in Theorem 14.11 ). More discussion on approximation loss for different 
dependences will be given in Section |6l Thus, we consider the approximate problem 



AT 



max 



^w„tr(H'fc+„R^^„Hfc+0 (46) 

"/k+n n=l 

subject to a'pj > bp, p = I, . . . , P, 

7i+„G{0,l},f = 1,2,...,L. 



Moreover, from the definition of tlk+n we have 



i=l s=l 
L L 



1=1 



= -7fc+„Bfc+„7fc+„, (47) 

where T^*^^ is the i-th row block and s-th column block of the matrix Tk+n, T^k+n — Rfc+„> the i-th 
row and s-th column of B^+n is — tr((H^^^)'T^''^^H^^^). Thus, the problem is equivalent to solving the 
following Boolean quadratic programming (BQP) problem 



mm 



N 

^ UJn7k+n^k+n7k+n (48) 
Tfc+n n=l 

subject to a'p'y > bp, p = 1, . . . , P, 

7fc+ne{0,l},i = l,2,...,L,n = l,...,A^. 

For this problem, however, it is still hard to obtain an optimal solution, since the nonconvex Boolean con- 
straints and Bfc4_„ may not be a positive semi-definite matrix. It is known to belong to the class of NP-hard 
problems. Fortunately, this class of problems can be solved by a recently developed computationally efficient 
approximation technique (see, e.g., 1251). We apply it to the problem ( |48] ) as follows. 
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By semidefinite relaxation (SDR) technique (see, e.g., If20ll25]| ). the problem (1481 ) can be relaxed to 

min tr(CX) (49) 

xe§(JVi+i), x^o 

subject to tr(EpX) > {Abp - 1' diag(ap)l), p = l,... ,P, 
tr(FsX) = 1, s = l,..., {NL + 1), 

where is a matrix with s-row and s-column Fs{s, s) = 1, others are equal 0, for s = 1, . . . , {NL + 1), 



diag(ap) 1 
1' 



C 



A 



B Bl \ 

^ 1 'B J ' 

B = diag(u;iBfc+i , . . . , u)NBk+N) 



where Bfc_|_„ is defined by (1471) for n = 1, . . . , A^; I and 1 are an identity matrix and a "1" vector with 
compatible dimensions respectively. The problem (|49l ) is an SDP problem. The derivation of the problem 
( |49l ) is given in Appendix. 

Note that the procedure for generating a feasible solution of the problem (1451 ) from the solution of the 
problem (|49l ) is also problem dependent, i.e., relying on the equalities or inequalities (l32l ) and the Boolean 
constraint. As an illustration, let us again consider the representative constraints dJT), (l22l ) and (|40] i. Thus, 
the specific expressions of a-p and bp in the optimization problem (|49l ) are given by (l42l ) and (|43] ). 

Based on the SDP 1491) , a typical Gaussian randomization procedure is used to construct an approximate 
solution to the problem ( |45] ) here (see 1251 ). Thus, we have the following algorithm. 

Algorithm 5.1 (Sensor selection scheme with the energy constraint for correlated sensors). 

• Step 1: Given an optimal solution of the SDP ([49\l X* G S(^^+^), and a number of randomizations S. 

• Step 2: Generate S feasible solutions by Gaussian randomization procedure based on X*: 
for s=l:S 

1. Generate a vector ~ -^(0, X*). Set 7]s = ^^(1 : NL) which means the first NL entries of ^g- 

2. Without loss of generality, assume that uji < UJ2 < ■ ■ ■ < ujn- generate the selection 
scheme from the N-th time step to the first time step. Set the index set of candidate sensors 
i4{l,...,L}. 

* Iteratively generate "fl+N-n sf^^ ~ ^)"^^ ^'"^^ ^t^P> n = 0, . . . , {N — 1) 

for n = 0:{N-l) 

Ik+N-n.s = S n -r • • Jori = l,...,L, (50) 
0, if I e 12, 
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where ii is the index set of the first mk+N-n maximum entries ofris{J{L — n — 1) + 1 : 
J(L — n)) in the index set of candidate sensors i and i2 = i — ii- Set ml. := — 1, for 
i G ii- Update the index set of candidate sensors i = {i : > 0, i = 1, . . . , L}. 
end 

3. Denote Jk+N-n,s = {ll+N-n,s^ ■ ■ • ' lt+N-n,s) ^^^^ 7s = (7fc+l,s, • • • , lk+N,s)- 

end 

• Step 3: Determine s* = argmax^^i 5 /(Ts) where /(•) may be the objective funcrions fi, or /a 
defined in (|9l), diOl ) and (EH) respectively. 

• Step 4: Output 7 = 7s* as the sensor selections of the problem \45\ . 

Note that specific design of the randomization procedure technique is problem dependent. Here, to 
construct the feasible solution satisfying the constraints (|2TI ). (l22l ) and (l40l ). we employ Equation (l50l ). The 
choice of S will be discussed in Section [6l Based on simulations, the randomized solution can often achieve 
a good performance with a small S, which is similar to that in (251. The main computational complexity 
of the algorithm is in Step 1 where an SDP problem needs to be solved. The SDP problem can be solved 
efficiently by using interior-point methods (see, e.g., (201). 

6 Numerical Examples 

In this section, we present a number of illustrative examples. Both uncorrelated and correlated sensor mea- 
surement noise cases are considered. 

6.1 Uncorrelated sensor measurement noises 

We first compare the performance of the approach given in Theorem 14. 1 1 with the one in Joshi an Boyd (31 
and the one in Mo et al. O. 

Example 6.1. Let us consider a dynamic system with L = 40 sensors which are uniformly distributed over 
a square of size 100 m. The parameter matrices and noise covariances for the dynamic system (I!]'-® are 

where r\ and r| are randomly sampled from the uniform distribution in [5, 7] and [10 12] respectively. We 
consider a constraint, i.e., select rrik+n = [1; 5, 10, 15, 20] sensors from 40 sensors at the next time step 
respectively. 
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In Figure [U the traces of the estimation error covariance are plotted for the sensor selection method given 
in Theorem 14.11 the one in Joshi an Boyd Q and the one in Mo et al. (4] respectively. The CPU time is 
plotted in Figure |2]for the three algorithms respectively. Figure [T]shows that the three methods obtained very 
close and similar estimation performance for the numerical example, while Figure |2]shows that the CPU time 
of the method in Theorem 14.1 l is much smaller than that of the one in Joshi an Boyd f3) and the one in Mo et 
al. H. The reason is that the method in Theorem 14.1 l is an analytical solution. In addition, the computation 
time of the three methods is not an increasing function of the number of selected sensors. The reason is 
that when the number of selected sensors increases, the number of the decision variables does not increase 
and the structure of the optimization does not change; only some parameters of the equality constraints are 
changed. 

Moreover, we consider a representative target tracking dynamic system with energy constraints. We 
assume that each target will be tracked in a Cartesian frame. The four state variables include position and ve- 
locity {x, X, y, y) respectively (see e.g., |[24l ). The parameter matrices and noise covariances for the dynamic 
system ©-dUl are 



HI 



1 


T 











1 














1 


T 











1 


1 

















1 






Qa 



I T^/3 rV2 




1,...,L, 



T 








T 



\ 



(53) 



(54) 



where T = 1 s is the sampling interval; Fj^, Qj^, H^, are the same in the following examples. The difference 
is the noise covariance of measurements, R^, in the following examples. Since the algorithm in Joshi an 
Boyd IS that requires the measurement matrix is full-column rank when the measurement matrices of each 
sensor are the same and the one in Mo et al. |@] does not present how to threshold the approximate solution 
to generate a feasible solution satisfying the energy constraints, we evaluate the performance of Algorithm 
I4.2l by comparing with the exhaustive method for a monitoring system which has a small number of sensors 
and using the gap given in the Step 3 of Algorithm l4.2l for a monitoring system which has a large number of 
sensors in the following examples respectively. 

Example 6.2. First, to compare with the exhaustive method, let us consider a relatively small monitoring 
system with L = 9 sensors which are uniformly distributed over a square of size 100 m. The parameter 
matrices of the dynamic system are given in d5jD-fl5?l) where 



r\ 
ri 



(55) 



r\ and r\ are randomly sampled from the uniform distribution in [5, 10]. We consider the optimization 
problem f l?7l ) with temporally inseparable constraint f l?Ol ) and the constraints f l27] ). f l22l ) where N = 3, 
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rrik+n = 2, n = 1, . . . , N and m], = 2, i.e., select 2 sensors from 9 sensors at each time step and select each 
sensor less than twice in 3 time steps. 

In Figure |3j the traces of the final estimation error covariance /i based on the three methods are plotted 
respectively, where r\ and r2 are randomly sampled 50 times. The three methods are 1) the exhaustive 
method that minimizes the final estimation error covariance /i, 2) Al gorithm 14. 2 I that maximizes the weighted 
information measure /s with weights [1/3, 1/3, 1/3] and 3) Algorithm 14.21 that maximizes the weighted 
information measure /a with weights [0, 0, 1] respectively. Similarly, the traces of the average estimation 
error covariance /2 are plotted in Figure |4] In Figure |5] the sum of information measures of time steps 
Yln=i *^(-f^-fc+n-^fc+n-^'«+") plottcd for the two sensor selection schemes respectively. They are obtained 
from 2) and 3) respectively. 

From Figures |3}^5] we have following observations: 

• Figure |3]shows that the trace of the final estimation error covariances obtained from Algorithm 14.2 I with 
two different weights that maximizes the weighted information measure /a are very close to that of 
the exhaustive method. Similarly, Figure |4]shows that the trace of average estimation error covariance 
obtained from Algorithm 14.21 with weights [1/3, 1/3, 1/3] that maximizes the weighted informa- 
tion measure is very close to that of the exhaustive method. These indicate that maximization of 
the weighted information measure /a is a good alternative criterion for minimizing final or average 
estimation error covariance for sensor selection. 

• Moreover, in Figure |3] when the objective is minimization of the final estimation error covariance /i, 
both Algorithm |4!2] with weights [0, 0, 1] and Algorithm |4!2] with weights [1/3, 1/3, 1/3] are near 
optimal for sensor selection. However, Figure |5]shows that the sum of information measures of N time 
steps for Algorithm 14.21 with the weights [1/3, 1/3, 1/3] is larger than that of Algorithm 14.21 with the 
weights [0, 0, 1]. Thus, it is better to choose the weights [0, 0, 1], since a larger sum of information 
measures of N time steps implies that more good sensors are used. 

• Finally, Figure |4] shows that when the objective is to minimize the average estimation eixor covari- 
ance /2, it is better to choose the weights [1/3, 1/3, 1/3] since Algorithm 14.21 with the weights 
[1/3, 1/3, 1/3] is near optimal. 

Example 6.3. Next, let us consider a large monitoring system with L = 20 x 20 = 400 sensors which are 
uniformly distributed in a square of size 100 m. We consider the optimization problem f UZD with temporally 
inseparable constraint d40D and the constraints f l2iD . d22D where N = 5, rrik+n = 10, n = 1, . . . , and 
= 2, i.e., select 10 sensors at each time step from 400 sensors and select each sensor less than twice in 5 
time steps. Moreover, we consider the performance of Algorithm \4.2\f or different cases ofrnk+nfrom 10 to 
100. Obviously, the exhaustive method is infeasible. 

In Figure |6l the upper bound and lower bound of the objective function of the optimization problem ( |37] ) 
are plotted based on 50 Monte Carlo runs. The corresponding gaps, i.e, the upper bound minus the lower 
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bound are plotted in Figure |7] Figures |6]and |7]show that the gaps are very small and Algorithm 14.21 can 
obtain the optimal solution in the sense of maximizing the weighted information measure in many cases, 
although the sensor network is large where the Boolean decision variables are more than 2000. Figure |8] 
presents the gaps as a function of nik+n from 10 to 100. It shows that the gaps are increasing as the number 
of selected sensors. 



6.2 Correlated sensor measurement noises 

In this subsection, we will compare Algorithm 15 . 1 1 with the exhaustive method for a simple problem so that 
the approximation loss can be computed. For this, we assume that only the sensor selection scheme for the 
next step is to be designed, i.e., = 1. For = 5, we will compare Algorithm |5T| with Algorithm I4.2l that 
ignores dependence. In this case, the exhaustive method is infeasible, since we have to enumerate 2.43 x 10^° 
cases. At the end, an example that compares the root mean square error (RMSE) of state estimation based on 
sensor selection is presented. 

Example 6.4. Let us consider L = 25 for the sensor network shown in Figure \T4l Assume that there is a 
jammer signal with a covariance at the position (550 m, 200 m), besides the natural noises v^, z = 
1, . . . ,L which are independent of v^. The jamming signal introduces dependence among measurement 
noises. Thus, the noises at the i-th sensor is given as follows 



v,+ 



1 + Ko 



(56) 



where f3i = jqi^n-; (ij,o th^ distance between the jammer and the i-th sensor; the signal decay exponent 
n = 2, the scaling parameter a = I and different values for the signal power Pq = [1, 3, 6, 10, 12, 15, 20] x 
10^ are used in simulations respectively. Thus, noises of sensors are correlated and the i-th block and j-th 
block of the noise covariance Rfc can be computed by H56\) to be 



Cov{v 



(57) 



where 




ifi =3 



i 7^ J- 



R? 




are used in simulations. Note that the corresponding Pearson 's correlation coefficients between sensors are 
approximately equal to [0.1, 0.2, 0.4, 0.6, 0.7, 0.8, 0.9] corresponding to = [1, 3, 6, 10, 12, 15, 20] x 
10^ respectively. We consider the optimization problem f |45D with temporally separable constraints d2iD . d22D 
where N = 1, ruk+i = 2, i.e., select 2 sensors from 25 sensors at the next time step. 
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In Figure |9j comparisons of the objective function tr(H'^^^R^^j^Hfc+i) of tlie optimization problem 
(|45] l (i.e., the information measure of the {k + l)-th time step) based on the exhaustive method, Algorithm 
15. II and Theorem 14. 1 1 that ignores dependence are plotted for different jammer signal powers respectively. 
We present the performance of Algorithm 15. 1 1 with small numbers of randomizations S" = 20,5 = 100. 
Similarly, comparisons of the traces of the estimation error covariance of (A; + l)-th time step are plotted in 
Figure [TOl 

From Figures l9]-[T0l we have the following observations: 

• For all the three methods, the larger is the signal power of jammer, the smaller is the information 
measure of the (A; + l)-th time step obtained from the selected sensors and the larger is the trace of the 
estimation error covariance at the {k + l)-th time step. 

• Figures |9]-fT0l also show that the exhaustive method yields better results than Algorithm 15. 1 1 with small 
S and the latter is better than the method of Theorem 14.11 that ignores dependence, especially in the 
case of strong dependence (i.e., strong signal power of the jammer). 

• Figures l9]-[T0l indicate that larger the value of 5 is , the closer is the performance of Algorithm 15. II to 
that of the exhaustive method, i.e., the smaller is the approximation loss of Algorithm 15. II 

Example 6.5. Next, let us consider a monitoring system with a large N = 5 and L = 5 x 5 = 25 sensors 
which are uniformly distributed in a square of size 100 m . We consider the optimization problem dj7D with 
temporally inseparable constraint d40D and the constraints f l2jD . d22D where ruk+n = 2, n = 1, . . . , N and 
ml, = 2, i.e., select 2 sensors at each time step from 25 sensors and select each sensor less than twice in next 
5 time steps. 

In Figure [TT] comparisons of the objective function X]^=i '^n ^'^i^'k+n^t+i-i^k+n) of the optimization 
problem (|45] ) (i.e., the sum of the weighted information measure of time steps, /s defined in ( [34l )) based 
on approaches of Algorithm 15.11 and Algorithm |4.2| that ignores dependence are plotted for different jammer 
signal powers respectively. We examine the performance of Algorithm 15.11 as a function of the number 
of randomizations S = 20, S = 100, S = 2000 which are small, compared with the exhaustive number 
2.43 X 10^*^. Similarly, comparisons of the traces of the average estimation error covariances of time steps 
are plotted in Figure [T2l 

Figures [TTI - [T2] show that Algorithm 15. 1 1 with a small value of S is better than Algorithm |4.2| that ignores 
dependence, especially in the case of strong dependence (i.e., strong signal power of the jammer). In addition. 
Figures [TTI - [T2] indicate that larger the value of S is, the better is the performance of Algorithm 15. 1 I than that 
of Algorithm 14 . 2 1 that ignores dependence. 

Example 6.6. Finally, let us consider the L-sensor noise covariance R^, k = 1,2,... which depends on 
the state x^. A frequently made assumption is that larger is the distance between the sensor and the target, 
larger is the noise covariance. However, when we design the sensor selection scheme of next N time steps at 
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time k, we do not know the state ^k+n that we replace it by the state prediction ^k+n\k which is used to 
compute the Rfc+n, n = \, . . . ,N. Specifically, the noise covariance R/t+n is 

R/fc+rt = R/fc+rt + Rfc+nj (58) 

where Rfc+n is the noise covariance from the jammer signal defined in d57D and the signal power of 
jammer Pq = 1.5 x 10^; Rfc+n is a diagonal matrix with the i-th diagonal block defined as follows 

Rf+„=f"f'" ° V^ = 1,...,L, ,n = l,...,iV, (59) 

where ai = 0.05 is a scaling parameter; dt^n is the distance between the target prediction x^+^i^ and 
the i-th sensor. We consider the optimization problem f UZD with temporally separable constraint d40D and 
the constraints d2iD . d22D where N = 5, ruk+n = 2, n = 1, . . . ,N and ml. = 2, i.e., select 2 sensors 
at each time step and select each sensor less than twice in 5 time steps. The initial state of the target is 
(600 m, -20 m/s, 200 m, m/s). 

In Figure [131 RMSE of the state estimates based on 200 Monte Carlo runs is given. We compare Algo- 
rithm |4!2] that ignores dependence with Algorithm 15. 1 1 with S=20, S=100 and S=2000 respectively. 

As far as the RMSE is considered, Figure [13] also shows that Algorithm 15.11 with a small value of S 
is better than Algorithm 14.21 that ignores dependence and that larger the value of S is, the better is the 
performance of Algorithm I5.1l than that of Algorithm I4.2l that ignores dependence. 



7 Conclusion 



In this paper, we have proposed a generalized information filter for target tracking in wireless sensor networks 
where measurements from a subset of sensors are employed at each time step. Then, under a regularity condi- 
tion, we proved that the multistage look-ahead policy that minimizes either the final or the average estimation 
error covariances of next N time steps is equivalent to the myopic sensor selection policy that maximizes the 
trace of the generalized information gain at each time step. When the measurement noises are uncorrelated, 
the optimal solution has been derived analytically for sensor selection with temporally separable constraints. 
For temporally inseparable constraints, the sensor selection scheme can be obtained by approximately solv- 
ing an LP problem. Although there is no guarantee that the gap between the performance of the chosen 
subset and the performance bound is always small, numerical examples showed that the algorithm is near- 
optimal in many cases and the selection scheme for a large sensor network with more than 2000 Boolean 
decision variables can be dealt with quickly. Finally, when the noises of measurements are correlated, the 
sensor selection problem with temporally inseparable constraints was relaxed to a BQP problem which can 
be efficiently solved by a Gaussian randomization procedure by solving an SDP problem which can be solved 
by interior-point methods and related software tools. Numerical examples showed that the proposed method 
is much better than the method that ignores dependence. 
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Future work will involve the generalization from the linear dynamic systems to nonlinear dynamic sys- 
tems. The equivalence between multistage look-ahead optimization policy for sensor management and the 
myopic sensor optimization policy and the corresponding sensor management schemes will be investigated. 
In addition, it can be considered for various applications such as robotics, sensor placement for structures 
and different types of wireless networks. 

Appendix 

The proof of Theorem 12.1 [ 

Proof. Notice that there are mfc+i number of 7^^^ = 1 and L — ruk+i number of Yk+i = so that there 
exists a permutation matrix P such that 



where ■ TUk+i) and Rfc+i(l : m^+i) are the stacked measurement matrices and the covariance of 

noises of the rrik+i selected sensor respectively; is a zero matrix with compatible dimensions. From the 
property of the permutation matrix PxP' = P'xP = I and the definition of Moore-Penrose inverse, we 
have 




(60) 



and 




(61) 



(PRfc+iP') 




Ra:+i(1 : TTlfc+i) 







(Rfc+i(l : mfc+i))-i 





(62) 
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Moreover, by equations (l60l)-(l62]i and repeatedly using the definition of Moore-Penrose generalized inverse 
and the property of the permutation matrix P x P' = P' x P = I, we have the following derivation 



+ 



, p/ ( : mfc+i)Pfc+iifcHfc+i(l : mk+i)') \ ^ 

I J , 



p, / Rfc+i(l : nik+i) + H/,+i(l : mfc+i)Pfc+i|fcHfc+i(l : mfc+i)') \ p 



p/ [ (Rfc+i(l : "ifc+i) +Hfe+i(l : mfc+i)Pfc+i[fcHfc+i(l : mfc+i)') \ p 
I j 



p, / (Rfc+i(l : nik+i) +Hfe+i(l : mk+i)Pk+i\k'H.k+i{'^ ■ mk+i)') ® ] p 
\ j 



/ (Rfe+i(l :mfe+i))-i(I + Hfe+i(l :mfc+i) 0\ 
p' •Pfc+i|fcHfc+i(l : mfe+i)'(Rfc+i(l : mfc+i))"^)"^ 

V 0/ 

( (Rfc+i(l : "^fc+i))""^ \ 











(I + Hfc+i(l : mfe+i)Pfc+i|fcHfc+i(l : mk+i)' (Rk+i{l ■ rrik+i)) ^) 

I 
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p, / (Rfc+i(l : m^+i))"' 



( 1+ ( ^'=+1(1 • "ifc+i)Pfc+i|fcHfc+i(l : mfc+i)' 
11 



P 



P 



(Rfe+i(l \ \ ' 



, ( (Rfc+i(l : mfe+i))-i \ 
/ 



\ 



p' 



(Rfe+i(l \ \ ' 



(Rfc+i(l : mfc+i))-i 



P' + P' ( • "^fc+OPfc+ilfcHfc+iCl : "ifc+i)' j 







P'l (^'=+^(1^^"^'=^^))'' « ](P' + P'PH,^,P,^,,H',^,P'PR+^, 



/ (Rfc+i(l : mfe+i)) 1 > ~+ 1 



P 



V (Rfc+i(l : mfe+i)) 1 0\ ~ 1 



^ 



- R-fc+l(I + Hfc+lPfc+l|fcH;,+ lR^^J 

From Equations (fT4l ) and (1631 ). we have 



Kfc+i — Pfc+ijfcH'^^i (^Rfc+i + tlk+i^k+ilk^'k+i 

= Pfc+ilfeH^+iR-fc+i (l + 'H-k+i'Pk+ilk'H-'k+i^k+i) 
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which yields 
so that 

Kk+il + Kk+iJlk+iP k+i\k^'k+i^k+i = ^k+i\k^'k+i^k+i- 

Moreover, we have 

Kfc+i = (I - Kk+iilk+i)^ k+l\k^'k+i^k+l 

= ^k+llk+ltl'k+i^k+l^ (64) 
where the derivation of (|64] | is based on ([T2l ). 
From Equations ([T2l ) and ([64]), we have 

= -^fc+i|fc+i ~ -^fc+i|fc+i^fe+i^'^'+i' 
= Pfc+i|fc+i ~ H'^'+iI^fc+iHfc+i- 

Thus, we have P^l^^_^^ = P^^,,;- + H'^+iR+^iHfc+i. 

By (dB, ClI) and (gl, 

= (I - Kfc+iHfc+i)xfc_,_i|fc + Pfc+i|fc_|_iH'^,^;^R^_^-,^ZA,.+i, 

= Pfc+l|fc+lPfc+i|A,.Xfc+l|A; + Pfc+l|fc+lH'fc+l^fc+iZA;+l, 
= PA:+l|fc+l{Pfc+l|fe^fc+l|fc + Hfc+iR^+iZfc+i}, 

Thus, we have x^+ii^+i = P k+ilk+iiPkliik^k+ilk + H'^+iRfc+iZfc+i}- 



The proof of Lemma IXTl 

Proof. First, we prove that the optimal solution of (Ai) is also the optimal solution of (^42)- If xi is the 
optimal solution of (Ai), then, for arbitrary x € 5, M(x) ^ M(xi) which yields tr(M(x)) < tr(M(xi)). 
Thus, xi is also the optimal solution of (A2). 

On the other hand, if X2 is the optimal solution of (^2). then, for arbitrary x G 5, tr(M(x)) < 
tr(M(x2)) which implies tr(M(xi)) < tr(M(x2)). Notice that the problem (Ai) has an optimal solution xi 
which yields tr(M(x2)) < tr(M(xi)). Thus, tr(M(xi)) = tr(M(x2)) so that tr(M(xi) - M(x2)) = 0. 
By tr(M(xi) - M(x2)) = and M(xi) - M(x2) >z 0, we have M(xi) = M(x2). Therefore, X2 is also 
the optimal solution of (^i). 
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The proof of Lemma IX2l 

Proof. In one direction: if xj, . . . , x^ is the optimal solution of ((i?i) with n = N), then, for n = 1, . . . , A^, 

Mn{-Kl, . . . ,x*) ^ M„(xi, . . . ,x„) for arbitrary Xj e 5^, i = 1, . . . , n, 

since the optimal solution that minimizes M„(xi, . . . , x„) is the same as that minimizes M„+i(xi, . . . , x„+i), 
forn = 1, . . . , A^. Thus, 

N N 

^M„(xi,. . . ,x*) ^ ^M„(xi, . . . ,x„) for arbitrary x^ G 5^, i = 1, . . . , n, 

n=l n=l 

which yields xj, . . . , x^ is also the optimal solution of {B2). 

On the other hand, assume that xj, . . . ,x|^ is the optimal solution of (i?2)- If x^, . . . ,x^ is not the 
optimal solution of (-Bi), then there exists an optimal solution x^, . . . ,x^ which has a smaller objective 
function value than that of xj, . . . ,x|^. Since the optimal solution that minimizes M„(xi, . . . ,x„) is the 
same as that minimizes M„+i(xi, . . . ,x„+i), x^, . . . ,x^ is the optimal solution of M„(xi, . . . ,Xn,) for 
n = 1, . . . , N. Thus, 

M„(xl , . . . , x;) >r M„(xO, . . . , x° ), n = 1, . . . , iV, 

so that 

N N 



5^M„(xt,...,x:)^^M„(x?,...,xO) 



n=l n=l 

which yields a contradiction. Thus, x|, . . . , x^ is the optimal solution of (Bi). 



The proof of Lemma |331 

Proof. If the problem (l20l ) has an optimal solution, from the fact that any positive definite matrix A Aq 
implies A^^ ^ then we have that the problem (|20l ) is equivalent to solve 



max 



(Pk+N\k+Nilk+1^ ■ ■ ■ )7fc+l, • • • ,lk+N^ ■ ■ ■ >7fc+Ar)) ^ 

= ^k+N\k+N-l + Hfc+ArRfc+^Hfc+AT 
L 

subject to ^ 7^_^„ = ruk+n, n = l,...,N, 

7fc+ne{0,l},i = l,2,...,L, n = l,...,N, 
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which has the same optimal solution. Since the constraints are temporally separable, and by Equation ([TST i 
and the invertibility of F^, it is equivalent to solve the following two problems 

L 

subject to ^ 7|,4.„ = nik+n, n = N, 

-fl+^ G {0,l},i = 1,2,. ..,L, n = N, 

and 



mm 



'Pk+N-l\k+N-l{lk+li ■ ■ ■ >7fc+l, ■ ■ ■ ,7k+N-l? ■ ■ ■ ^Ik+N-l) 

= (Pfc+Ar-l|fe+Af-2 + ^'k+N~l^t+N-l^k+N-l) ^ 
L 

subject to ^ 7^+„ = mk+n, n = 1, . . . , iV, -1, 

i=l 

Yk+n e {0,1}, i = 1,2,...,L, n= l,...,iV,-l. 

Both of them have an optimal solution respectively. 

After A^-step recursive decomposition, the problem (|20] | is equivalent to solve the following N optimiza- 
tion problems 

max H'/,,+„R+_^Hfc+„ forn = 1, . . . , iV, 

L 

subject to ^ 7fc+„ = iTik+n, 

i=l 

7i+„G{0,l},i = l,2,...,L. 

All of optimization problems have an optimal solution respectively. 

Moreover, for the problem (|20) . we consider minimizing Pk+n\k+n Pk+n+i\k+n+i respectively. 
Both of them have a recursive decomposition similar to that of minimizing Pk+N\k+N for the problem (|20] |. 
Thus, we have 

min Pfc+„|fc+n(7fe+i,---,7fc+i'---'7fc+n,---,7fe+n) (65) 

L 

subject to ^ 7fc+^ = mfc+s, s = 1, . . . , n, 

7fc+s G {0, 1}, i = 1, 2, . . . , L, s = 1, . . . , n, 
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and 

mill Pk+n+llk+n+lill+l^ • • • ; 7fc+l' • • • ' 7fc+n+l> • • ■ > 7fc+„+l) (66) 

L 

subject to ^ 7fc+^ = mfc+^, s = 1, . . . , n + 1, 

i=l 

7^, G {0,1},^ = 1,2,...,L, s = l,...,n + l, 

have the same optimal solutions (7^^^)*, i = 1, . . . ,L,s = 1, . . . , n respectively. By Lemma |3.2[ the 
problem (l20l ) is also equivalent to solving the problem (l23T i. Therefore, if the primal sensor selection problem 
(I20I ) has an optimal solutions, both the problem (l20l ) and the problem (1231 ) can be equivalently transformed 
to solve the problem (|28] ). 



The derivation of the problem ([49b 

By (7fc+„)^ = 7fc+„. the problem (|48]l is equivalent to 

min 7'B7 

subject to 7' diag(ap)7 t> bp, p = 1, . . . , P, 

7Une{0,l},i = l,2,...,L,n = l,...,iV, 

where 7 is defined in (1331 ): diag(ap) and B = diag(a;iBfc_|_i, . . . ,a;ivBfc+Ar) are diagonal matrix and diag- 
onal block matrix respectively. If we let t^_^„ = 27|_,_^ — 1 and denote by Tk+n — {Tk+n^ • • • ' ^k+n)' ^'^'^ 
r = (t^4_]^, . . . , T^j^^y, then the problem is equivalent to 

1 



mm — 



:r + l)'B(r + l) (67) 



subject to \^'^ ^ + 1) - P = 1> • • • 1 -P) 

(r|+„)2 = l,i = l,2,...,L,n = l,...,Ar, 

where 1 is a 1 vector with compatible dimensions. Moreover, it is equivalent to 

(r + l)'B(r + l) (68) 



mm 

k-\-rt ' 



, B B 1 W Tk+n 

I'B J I t 



subject to MiagK) Ml ' 1 > 4&p- l'diag(ap) 1, p = 1,...,P, 



1' 

\2 



(r^+„)" = l,i = 1,2,. .. ,L,n = l,...,iV, 



t2 = l. 
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Problem (|67] ) is equivalent to (1681 ) in the sense: if (r*, f*) is the optimal solution to (|68] ). then r* (respectively 
— T*) is an optimal solution to ( [67] ) when t* = 1 (respectively t* = —1). Moreover, the problem is equivalent 
to 



Es is a matrix with s-row and s-column Es(s, s) = 1, others equal 0, for s = 1, . . . , {NL + 1). By 
introducing a new variable X = (r' t)'(r' t) and removing the constraint rankiX.) = 1, the problem ( |69l ) 
can be relaxed to the problem (|49T l. 
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Number of selected sensors 

Figure 1: The traces of the estimation error covariance are plotted as a function of number of selected sensors. 



^^Mo et al's algorithm 
-^Joshi et al's algorithm 
-©-Our algorithm in Theorem 4.1 




Number of selected sensors 
Figure 2: The cpu times are plotted as a function of number of selected sensors. 
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Figure 3: The traces of the final estimation error covariance for 50 Monte Carlo runs. 
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Figure 4: The traces of the average estimation error covariance in 50 Monte Carlo runs. 
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-Alogrithm 4.2 with weights=[l/3 1/3 1/3] 
- Alogrithm 4.2 with weights=[0 1] 




10 



15 20 25 30 35 
The r-th Monte Carlo run, r=l,. . .,50 



40 



45 



50 



Figure 5: The sum of information measures of N time steps for 50 Monte Carlo runs 
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Figure 6: The upper bound and lower bound of the objective function of the optimization problem (|37] ) for 
50 Monte Carlo runs. 
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Figure 7: The gaps, i.e. the upper bounds minus the lower bounds shown in Figure |6] 
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Figure 8: The average gaps based on 50 Monte Carlo runs are plotted as a function of number of selected 
sensors. 
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Figure 9: Information measure based on the selected sensors at the (/c + l)-th time step from weak to strong 
signal power of the jammer (from weak to strong correlation between sensors). 
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Figure 10: The trace of estimation error covariance at the (fc + l)-th time step from weak to strong signal 
power of the jammer (from weak to strong correlation between sensors). 
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Figure 1 1 : The sum of the weighted information measures of time steps from weak to strong signal power 
of the jammer (from weak to strong correlation between sensors). 
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Figure 12: The trace of the average estimation error covariance of N time steps from weak to strong signal 
power of the jammer (from weak to strong correlation between sensors). 
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Figure 13: RMSE of the state estimates based on 200 Monte Carlo runs. 



500 r 



400 



300- 



200 



100 



o 



° i=5 ° i=10 ° i=15 ° i=20 



n i=4 ° i=9 ° i=14 ° i=19 



° i=3 



° i=2 ° i=7 



Fusion center 
0- °i=l 



° i=13 ° i=18 



° i=12 



= 17 



° i=6 ° i=ll ° i=16 



□ The initial position 
[=25 '-^^ target 



1=24 



i=23 



i=22 



* + 



Jammer 

□ Tlie initial velocity 

i=21 (-20 m/s, Om/s) 



-100 



100 



100 



200 



300 



400 



500 



600 



700 



Figure 14: The sensor network with a jammer 
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